Home Projects Agentic Browser System Architecture Backend Server Design

Backend Server Design

Referenced Files

main.py api/main.py api/run.py core/config.py core/llm.py routers/health.py routers/website.py routers/react_agent.py services/website_service.py services/react_agent_service.py agents/react_agent.py mcp_server/server.py models/requests/website.py models/response/website.py pyproject.toml

Introduction#

This document describes the backend server design built with FastAPI and a complementary MCP server. It explains the application structure, routing organization, service layer architecture, and the LLM provider abstraction supporting multiple AI models. It documents configuration management across environments, the modular router system for API endpoints, request/response handling patterns, middleware implementation, and error handling strategies. It also covers the separation between API endpoints and business logic, dependency injection patterns, concurrency handling, external service integrations, caching strategies, performance optimization, scalability, load balancing, and monitoring approaches.

Project Structure#

The backend consists of:

Application entrypoint that selects between API and MCP modes
FastAPI application with modular routers and a central run script
Core configuration and LLM abstraction
Routers for each domain endpoint
Services implementing business logic
Agents orchestrating multi-step reasoning with tools
MCP server exposing tools for external clients

graph TB Entry["Entry Point
main.py"] --> ModeSel{"Mode Selection"} ModeSel --> |API| APIApp["FastAPI App
api/main.py"] ModeSel --> |MCP| MCP["MCP Server
mcp_server/server.py"] APIApp --> Routers["Routers
routers/*"] APIApp --> Config["Config
core/config.py"] APIApp --> Run["Uvicorn Runner
api/run.py"] Routers --> Services["Services
services/*"] Services --> Agents["Agents
agents/react_agent.py"] Services --> LLM["LLM Abstraction
core/llm.py"]

Diagram sources

Section sources

Core Components#

Entry point and mode selection: supports running as API server or MCP server with optional interactive or non-interactive mode.
FastAPI application: defines routes under multiple prefixes and includes health, GitHub, website, YouTube, Google Search, Gmail, Calendar, PyJIIT, React agent, website validator, agent, and file upload endpoints.
Configuration: environment-driven settings for host, port, debug level, and Google API key; centralized logger factory.
LLM abstraction: provider-agnostic initialization and generation interface supporting Google, OpenAI, Anthropic, Ollama, DeepSeek, and OpenRouter.
Routers: per-domain endpoints with dependency injection of services and standardized error handling.
Services: business logic implementations for website QA, React agent orchestration, and related tasks.
Agents: LangGraph-based reasoning with tool execution and message normalization.
MCP server: exposes tools for LLM generation, GitHub Q&A, website markdown fetching, and HTML-to-markdown conversion.

Section sources

Architecture Overview#

The system separates concerns across layers:

Presentation: FastAPI routers define endpoints and handle request validation and error mapping.
Business logic: Services encapsulate domain-specific workflows.
Orchestration: Agents coordinate tool use and multi-step reasoning.
Infrastructure: Configuration and LLM abstraction provide pluggable providers and runtime settings.

graph TB subgraph "Presentation Layer" RWebsite["Website Router
routers/website.py"] RReact["React Agent Router
routers/react_agent.py"] RHealth["Health Router
routers/health.py"] end subgraph "Business Logic Layer" SWebsite["Website Service
services/website_service.py"] SReact["React Agent Service
services/react_agent_service.py"] end subgraph "Orchestration Layer" Agent["React Agent Graph
agents/react_agent.py"] end subgraph "Infrastructure" LLM["LLM Provider Abstraction
core/llm.py"] CFG["Environment Config
core/config.py"] end RWebsite --> SWebsite RReact --> SReact SWebsite --> LLM SReact --> Agent Agent --> LLM RWebsite --> CFG RReact --> CFG RHealth --> CFG

Diagram sources

Detailed Component Analysis#

FastAPI Application and Routing Organization#

Central app definition with title and version.
Modular router inclusion under distinct prefixes for health, GitHub, website, YouTube, Google Search, Gmail, Calendar, PyJIIT, React agent, website validator, agent, and file upload.
Root endpoint returns app metadata.

graph LR App["FastAPI App
api/main.py"] --> Prefixes["Route Prefixes"] Prefixes --> H["/api/genai/health"] Prefixes --> G["/api/genai/github"] Prefixes --> W["/api/genai/website"] Prefixes --> Y["/api/genai/youtube"] Prefixes --> GS["/api/google-search"] Prefixes --> GM["/api/gmail"] Prefixes --> C["/api/calendar"] Prefixes --> P["/api/pyjiit"] Prefixes --> RA["/api/genai/react"] Prefixes --> V["/api/validator"] Prefixes --> A["/api/agent"] Prefixes --> U["/api/upload"]

Diagram sources

api/main.py

Section sources

api/main.py

Request/Response Handling Patterns and Error Handling#

Routers validate inputs and delegate to services using dependency injection.
Standardized try/catch blocks log errors and raise HTTP exceptions with appropriate status codes.
Responses are typed via Pydantic models where applicable.

sequenceDiagram participant Client as "Client" participant Router as "Website Router
routers/website.py" participant Service as "Website Service
services/website_service.py" Client->>Router : POST /api/genai/website Router->>Router : Validate request
Extract url, question, chat_history Router->>Service : generate_answer(url, question, chat_history, client_html) Service-->>Router : Answer string or error alt Success Router-->>Client : 200 OK with answer else Validation Error Router-->>Client : 400 Bad Request else Internal Error Router-->>Client : 500 Internal Server Error end

Diagram sources

Section sources

Dependency Injection and Service Layer#

Routers define dependency factories returning service instances.
Services encapsulate domain logic and integrate with tools and LLM providers.
Example: Website router depends on WebsiteService; React agent router depends on ReactAgentService.

classDiagram class WebsiteRouter { +POST /api/genai/website } class WebsiteService { +generate_answer(url, question, chat_history, client_html) } class ReactAgentRouter { +POST /api/genai/react } class ReactAgentService { +generate_answer(question, chat_history, ...) } WebsiteRouter --> WebsiteService : "Depends via Depends()" ReactAgentRouter --> ReactAgentService : "Depends via Depends()"

Diagram sources

Section sources

LLM Provider Abstraction#

Provider configurations map provider names to LangChain classes and parameter mappings.
Initialization validates provider support, model defaults, API keys, and base URLs.
Generation method constructs system and human messages and invokes the underlying client.
Default LLM instance is created for application-wide use.

classDiagram class LargeLanguageModel { +provider +model_name +generate_text(prompt, system_message) str } class Providers { +google +openai +anthropic +ollama +deepseek +openrouter } LargeLanguageModel --> Providers : "selects via config"

Diagram sources

core/llm.py

Section sources

MCP Server Integration#

Defines tools for LLM generation, GitHub Q&A, website markdown fetching, and HTML-to-markdown conversion.
Implements tool dispatch based on tool name and arguments.
Runs over stdio using MCP server framework.

sequenceDiagram participant Ext as "External Client" participant MCP as "MCP Server
mcp_server/server.py" participant LLM as "LargeLanguageModel
core/llm.py" Ext->>MCP : call_tool("llm.generate", args) MCP->>LLM : initialize with provider/model/api_key/base_url MCP->>LLM : generate_text(prompt, system_message) LLM-->>MCP : response text MCP-->>Ext : TextContent(text)

Diagram sources

Section sources

React Agent Orchestration#

Converts chat history and optional client HTML into LangGraph messages.
Builds a graph with an agent node and a tool execution node, conditionally routing between them.
Uses cached compilation for performance.

flowchart TD Start(["Start"]) --> BuildGraph["Build or Load Compiled Graph"] BuildGraph --> Messages["Normalize Chat History to Messages"] Messages --> InjectPage{"Client HTML Provided?"} InjectPage --> |Yes| AddPage["Add Page Context as SystemMessage"] InjectPage --> |No| AskQ["Append Question as HumanMessage"] AddPage --> AskQ AskQ --> RunGraph["Invoke Graph Async"] RunGraph --> Output["Extract Final Message Content"] Output --> End(["End"])

Diagram sources

Section sources

Configuration Management#

Environment variables drive host, port, debug level, and Google API key.
Logging level is derived from debug flag.
Centralized logger factory ensures consistent logging across modules.

flowchart TD Env["Load .env"] --> ReadEnv["Read ENV, DEBUG, BACKEND_HOST, BACKEND_PORT, GOOGLE_API_KEY"] ReadEnv --> SetLevel{"DEBUG true?"} SetLevel --> |Yes| Debug["Set logging level DEBUG"] SetLevel --> |No| Info["Set logging level INFO"] Debug --> Logger["Logger Factory"] Info --> Logger Logger --> Use["Used by routers and services"]

Diagram sources

core/config.py

Section sources

core/config.py

Middleware Implementation#

No explicit middleware is defined in the analyzed files. Logging is handled via module loggers and exception handlers in routers.

Section sources

Concurrency and Request Handling#

FastAPI uses async route handlers; services implement async methods for I/O-bound operations (external APIs, LLM calls).
LangGraph invocation is awaited, ensuring cooperative concurrency.

Section sources

External Service Integrations#

Website service integrates markdown fetching and HTML-to-Markdown conversion.
React agent service optionally uploads files to Google GenAI and uses LangChain messages.
MCP server integrates with LangChain providers and website tools.

Section sources

Caching Strategies#

LangGraph graph is compiled once and cached via a cached graph factory, reducing startup overhead for repeated invocations.

Section sources

agents/react_agent.py

Dependency Analysis#

The system exhibits clear layering:

Presentation depends on business logic
Business logic depends on agents and LLM abstraction
Configuration is consumed across layers
MCP server reuses LLM and tools

graph TB API["api/main.py"] --> Routers["routers/*"] Routers --> Services["services/*"] Services --> Agents["agents/react_agent.py"] Services --> LLM["core/llm.py"] API --> Config["core/config.py"] MCP["mcp_server/server.py"] --> LLM MCP --> Tools["Website Tools"]

Diagram sources

Section sources

Performance Considerations#

Async-first design: route handlers and services use async to handle concurrent requests efficiently.
Cached graph compilation: reduces repeated graph building costs in the React agent.
Minimal synchronous work in hot paths; offloads heavy operations to external services.
Environment-driven tuning: adjust debug level and provider/model settings via environment variables.

[No sources needed since this section provides general guidance]

Troubleshooting Guide#

Missing environment variables: ensure required keys (e.g., provider API keys and base URLs) are set; the LLM initializer raises explicit errors when missing.
Router-level validation: routers check required fields and return 400 for invalid requests.
Service-level errors: services catch exceptions and return user-friendly messages; routers map unexpected errors to 500.
Logging: configure logging level via environment; use module loggers to trace execution paths.

Section sources

Conclusion#

The backend employs a layered architecture with clear separation between presentation, business logic, orchestration, and infrastructure. FastAPI’s modular routers expose domain-specific endpoints under prefixed namespaces, while dependency injection keeps endpoints thin. The LLM abstraction enables multi-provider support and environment-driven configuration. Asynchronous services and cached graph compilation optimize concurrency and performance. The MCP server extends functionality externally, and robust error handling ensures predictable responses.

Appendices#

Endpoint Catalog#

Health: GET /api/genai/health
GitHub: GET /api/genai/github
Website: POST /api/genai/website
YouTube: GET /api/genai/youtube
Google Search: GET /api/google-search
Gmail: GET /api/gmail
Calendar: GET /api/calendar
PyJIIT: GET /api/pyjiit
React Agent: POST /api/genai/react
Website Validator: GET /api/validator
Agent: POST /api/agent
File Upload: POST /api/upload

Section sources

api/main.py

Startup and Runtime#

Entry point accepts mode flags and runs either API or MCP server.
API server uses Uvicorn with host/port from configuration.

Section sources

Scripts and Entrypoints#

Project scripts expose CLI commands for running API and MCP servers.

Section sources

pyproject.toml

Previous System Architecture

Next Browser Extension Architecture

Agentic Browser

AI Agent System

API Server

Browser Automation

Browser Extension

Data Models And Schemas

Prompts And Prompt Engineering

Service Integrations

System Architecture

Tool System

Backend Server Design

Table of Contents#

Introduction#

Project Structure#

Core Components#

Architecture Overview#

Detailed Component Analysis#

FastAPI Application and Routing Organization#

Request/Response Handling Patterns and Error Handling#

Dependency Injection and Service Layer#

LLM Provider Abstraction#

MCP Server Integration#

React Agent Orchestration#

Configuration Management#

Middleware Implementation#

Concurrency and Request Handling#

External Service Integrations#

Caching Strategies#

Dependency Analysis#

Performance Considerations#

Troubleshooting Guide#

Conclusion#

Appendices#

Endpoint Catalog#

Startup and Runtime#

Scripts and Entrypoints#